Pesquisa | Portal Regional da BVS

Correction: G-bic: generating synthetic benchmarks for biclustering.

Castanho, Eduardo N; Lobo, João P; Henriques, Rui; Madeira, Sara C.

BMC Bioinformatics ; 25(1): 16, 2024 Jan 11.

Artigo em Inglês | MEDLINE | ID: mdl-38212689

G-bic: generating synthetic benchmarks for biclustering.

Castanho, Eduardo N; Lobo, João P; Henriques, Rui; Madeira, Sara C.

BMC Bioinformatics ; 24(1): 457, 2023 Dec 06.

Artigo em Inglês | MEDLINE | ID: mdl-38053078

RESUMO

BACKGROUND: Biclustering is increasingly used in biomedical data analysis, recommendation tasks, and text mining domains, with hundreds of biclustering algorithms proposed. When assessing the performance of these algorithms, more than real datasets are required as they do not offer a solid ground truth. Synthetic data surpass this limitation by producing reference solutions to be compared with the found patterns. However, generating synthetic datasets is challenging since the generated data must ensure reproducibility, pattern representativity, and real data resemblance. RESULTS: We propose G-Bic, a dataset generator conceived to produce synthetic benchmarks for the normative assessment of biclustering algorithms. Beyond expanding on aspects of pattern coherence, data quality, and positioning properties, it further handles specificities related to mixed-type datasets and time-series data.G-Bic has the flexibility to replicate real data regularities from diverse domains. We provide the default configurations to generate reproducible benchmarks to evaluate and compare diverse aspects of biclustering algorithms. Additionally, we discuss empirical strategies to simulate the properties of real data. CONCLUSION: G-Bic is a parametrizable generator for biclustering analysis, offering a solid means to assess biclustering solutions according to internal and external metrics robustly.

Assuntos

Benchmarking , Perfilação da Expressão Gênica , Reprodutibilidade dos Testes , Análise por Conglomerados , Algoritmos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA